Search CORE

36 research outputs found

SchNet - a deep learning architecture for molecules and materials

Author: A. Tkatchenko
Baehrens D.
H. E. Sauceda
K. T. Schütt
K.-R. Müller
Kingma D. P.
P.-J. Kindermans
van den Oord A.
Zintgraf L. M.
Publication venue: 'AIP Publishing'
Publication date: 01/03/2018
Field of study

Deep learning has led to a paradigm shift in artificial intelligence, including web, text and image search, speech recognition, as well as bioinformatics, with growing impact in chemical physics. Machine learning in general and deep learning in particular is ideally suited for representing quantum-mechanical interactions, enabling to model nonlinear potential-energy surfaces or enhancing the exploration of chemical compound space. Here we present the deep learning architecture SchNet that is specifically designed to model atomistic systems by making use of continuous-filter convolutional layers. We demonstrate the capabilities of SchNet by accurately predicting a range of properties across chemical space for \emph{molecules and materials} where our model learns chemically plausible embeddings of atom types across the periodic table. Finally, we employ SchNet to predict potential-energy surfaces and energy-conserving force fields for molecular dynamics simulations of small molecules and perform an exemplary study of the quantum-mechanical properties of C

_{20}

-fullerene that would have been infeasible with regular ab initio molecular dynamics

arXiv.org e-Print Archive

Crossref

Open Repository and Bibliography - Luxembourg

MPG.PuRe

Inverse Classification for Comparison-based Interpretability in Machine Learning

Author: A Watson
D Baehrens
D Martens
E Štrumbelj
JD Tygar
JM Alonso
K Fernandes
LA Hendricks
M Gacto
MV Mannino
N Mvududu
R Harman
T Gog van
Y LeCun
Publication venue
Publication date: 22/12/2017
Field of study

In the context of post-hoc interpretability, this paper addresses the task of explaining the prediction of a classifier, considering the case where no information is available, neither on the classifier itself, nor on the processed data (neither the training nor the test data). It proposes an instance-based approach whose principle consists in determining the minimal changes needed to alter a prediction: given a data point whose classification must be explained, the proposed method consists in identifying a close neighbour classified differently, where the closeness definition integrates a sparsity constraint. This principle is implemented using observation generation in the Growing Spheres algorithm. Experimental results on two datasets illustrate the relevance of the proposed approach that can be used to gain knowledge about the classifier.Comment: preprin

arXiv.org e-Print Archive

Crossref

Scalable and Interpretable One-class SVMs with Deep Learning and Random Fourier features

Author: A Zimek
CC Chang
D Baehrens
DM Tax
EJ Candès
FE Grubbs
G Montavon
J Kim
JJ Hull
R Chalapathy
S Shalev-Shwartz
SM Erfani
T Schlegl
V Barnett
V Chandola
Y Bengio
Publication venue
Publication date: 14/10/2018
Field of study

One-class support vector machine (OC-SVM) for a long time has been one of the most effective anomaly detection methods and extensively adopted in both research as well as industrial applications. The biggest issue for OC-SVM is yet the capability to operate with large and high-dimensional datasets due to optimization complexity. Those problems might be mitigated via dimensionality reduction techniques such as manifold learning or autoencoder. However, previous work often treats representation learning and anomaly prediction separately. In this paper, we propose autoencoder based one-class support vector machine (AE-1SVM) that brings OC-SVM, with the aid of random Fourier features to approximate the radial basis kernel, into deep learning context by combining it with a representation learning architecture and jointly exploit stochastic gradient descent to obtain end-to-end training. Interestingly, this also opens up the possible use of gradient-based attribution methods to explain the decision making for anomaly detection, which has ever been challenging as a result of the implicit mappings between the input space and the kernel space. To the best of our knowledge, this is the first work to study the interpretability of deep learning in anomaly detection. We evaluate our method on a wide range of unsupervised anomaly detection tasks in which our end-to-end training architecture achieves a performance significantly better than the previous work using separate training.Comment: Accepted at European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD) 201

arXiv.org e-Print Archive

Queen's University Belfast Research Portal

Crossref

Interpreting random forest classification models using a feature contribution method

Author: A Liaw
A. Tropsha
C. Strobl
D Baehrens
DJ Hand
K Hansen
L Breiman
L Breiman
L Rosenbaum
L. Carlsson
V.E. Kuz’min
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 04/12/2013
Field of study

Model interpretation is one of the key aspects of the model evaluation process. The explanation of the relationship between model variables and outputs is relatively easy for statistical models, such as linear regressions, thanks to the availability of model parameters and their statistical significance . For “black box” models, such as random forest, this information is hidden inside the model structure. This work presents an approach for computing feature contributions for random forest classification models. It allows for the determination of the influence of each variable on the model prediction for an individual instance. By analysing feature contributions for a training dataset, the most significant variables can be determined and their typical contribution towards predictions made for individual classes, i.e., class-specific feature contribution “patterns”, are discovered. These patterns represent a standard behaviour of the model and allow for an additional assessment of the model reliability for new data. Interpretation of feature contributions for two UCI benchmark datasets shows the potential of the proposed methodology. The robustness of results is demonstrated through an extensive analysis of feature contributions calculated for a large number of generated random forest models

arXiv.org e-Print Archive

Crossref

Bradford Scholars

White Rose Research Online

Sparse Robust Regression for Explaining Classifiers

Author: A Alfons
A Henelius
D Baehrens
E Amaldi
E Smucler
G Ausiello
H Mobahi
H Wang
PJ Rousseeuw
PJ Rousseeuw
PJ Rousseeuw
PT Komiske
R Guidotti
R Tibshirani
VJ Yohai
Publication venue: Springer Nature Switzerland
Publication date: 28/10/2019
Field of study

Recipient of the best student paper award.Peer reviewe

Crossref

Helsingin yliopiston digitaalinen arkisto

Interpretation of microbiota-based diagnostics by explaining individual classifier decisions

Author: A. E. Budding
A. Eck
AE Budding
AS Day
B Gaonkar
C Casen
C Manichanh
D Baehrens
D Gevers
D Knights
E Bellaguarda
E Štrumbelj
E. F. J. de Groot
EK Costello
EK Wright
I Kononenko
I Sekirov
JA Sanford
L Daniels
L. M. Zintgraf
M Robnik-Šikonja
M. Welling
N Rolhion
P. H. M. Savelkoul
R Tibshirani
T. G. J. de Meij
T. S. Cohen
TG de Meij
Y Luo
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Adversarial Robustness on In- and Out-Distribution Improves Explainability

Author: A Torralba
C Leibig
D Baehrens
S Bach
S Wachter
T Miller
Y LeCun
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

Neural networks have led to major improvements in image classification but suffer from being non-robust to adversarial changes, unreliable uncertainty estimates on out-distribution samples and their inscrutable black-box decisions. In this work we propose RATIO, a training procedure for Robustness via Adversarial Training on In- and Out-distribution, which leads to robust models with reliable and robust confidence estimates on the out-distribution. RATIO has similar generative properties to adversarial training so that visual counterfactuals produce class specific features. While adversarial training comes at the price of lower clean accuracy, RATIO achieves state-of-the-art

l_2

-adversarial robustness on CIFAR10 and maintains better clean accuracy

arXiv.org e-Print Archive

Crossref

Publikationsserver der Universität Tübingen

ICIE 1.0:a novel tool for interactive contextual interaction explanations

Author: A Goldstein
D Baehrens
David Martens
E Štrumbelj
E Štrumbelj
L Breiman
LS Shapley
M Robnik-Šikonja
MA Honein
Publication venue: Springer
Publication date: 01/01/2019
Field of study

With the rise of new laws around privacy and awareness, explanation of automated decision making becomes increasingly important. Nowadays, machine learning models are used to aid experts in domains such as banking and insurance to find suspicious transactions, approve loans and credit card applications. Companies using such systems have to be able to provide the rationale behind their decisions; blindly relying on the trained model is not sufficient. There are currently a number of methods that provide insights in models and their decisions, but often they are either good at showing global or local behavior. Global behavior is often too complex to visualize or comprehend, so approximations are shown, and visualizing local behavior is often misleading as it is difficult to define what local exactly means (i.e. our methods don’t “know” how easily a feature-value can be changed; which ones are flexible, and which ones are static). We introduce the ICIE framework (Interactive Contextual Interaction Explanations) which enables users to view explanations of individual instances under different contexts. We will see that various contexts for the same case lead to different explanations, revealing different feature interactions.</p

Crossref

Pure OAI Repository

How should we regulate artificial intelligence?

Author: Al-shamasneh ARM
Baehrens D
Darlington K
Lemley MA
Morgan J
Wachter S
Zarsky T
Štrumbelj E
Publication venue: 'The Royal Society'
Publication date
Field of study

Crossref

Explanation-Based Weakly-Supervised Learning of Visual Relations with Graph Networks

Author: A Prest
B Alexe
C Lu
D Baehrens
J Zhang
M Everingham
M Tsubaki
O Russakovsky
S Bach
TY Lin
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

Visual relationship detection is fundamental for holistic image understanding. However, the localization and classification of (subject, predicate, object) triplets remain challenging tasks, due to the combinatorial explosion of possible relationships, their long-tailed distribution in natural images, and an expensive annotation process. This paper introduces a novel weakly-supervised method for visual relationship detection that relies on minimal image-level predicate labels. A graph neural network is trained to classify predicates in images from a graph representation of detected objects, implicitly encoding an inductive bias for pairwise relations. We then frame relationship detection as the explanation of such a predicate classifier, i.e. we obtain a complete relation by recovering the subject and object of a predicted predicate. We present results comparable to recent fully- and weakly-supervised methods on three diverse and challenging datasets: HICO-DET for human-object interaction, Visual Relationship Detection for generic object-to-object relations, and UnRel for unusual triplets; demonstrating robustness to non-comprehensive annotations and good few-shot generalization.Part of ISBN 9783030586034QC 20210323</p

arXiv.org e-Print Archive

Publikationer från KTH

Crossref

Digitala Vetenskapliga Arkivet - Academic Archive On-line